Search CORE

13 research outputs found

Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

Over the past five decades, k-means has become the clustering algorithm of choice in many application domains primarily due to its simplicity, time/space efficiency, and invariance to the ordering of the data points. Unfortunately, the algorithm's sensitivity to the initial selection of the cluster centers remains to be its most serious drawback. Numerous initialization methods have been proposed to address this drawback. Many of these methods, however, have time complexity superlinear in the number of data points, which makes them impractical for large data sets. On the other hand, linear methods are often random and/or sensitive to the ordering of the data points. These methods are generally unreliable in that the quality of their results is unpredictable. Therefore, it is common practice to perform multiple runs of such methods and take the output of the run that produces the best results. Such a practice, however, greatly increases the computational requirements of the otherwise highly efficient k-means algorithm. In this chapter, we investigate the empirical performance of six linear, deterministic (non-random), and order-invariant k-means initialization methods on a large and diverse collection of data sets from the UCI Machine Learning Repository. The results demonstrate that two relatively unknown hierarchical initialization methods due to Su and Dy outperform the remaining four methods with respect to two objective effectiveness criteria. In addition, a recent method due to Erisoglu et al. performs surprisingly poorly.Comment: 21 pages, 2 figures, 5 tables, Partitional Clustering Algorithms (Springer, 2014). arXiv admin note: substantial text overlap with arXiv:1304.7465, arXiv:1209.196

arXiv.org e-Print Archive

The application of numerical methods of data analysis to the genus Phyllota Benth. in New South Wales

Author: RC Jancey
Publication venue: 'CSIRO Publishing'
Publication date: 01/01/1966
Field of study

Multidimensional group analysis

Author: RC Jancey
Publication venue: 'CSIRO Publishing'
Publication date: 01/01/1966
Field of study

Multivariate analysis of polypeptide synthesis in field-grown maize inbreds and hybrids

Author: CL Baszczynski
DE Comings
JG Boothe
L Orloci
PH Fewster
PH O'Farrell
RC Jancey
RC Jancey
RJ Mans
TG Crowe
TG Crowe
TG Crowe
WG Hughes
WM Bonner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Popmusic — Partial Optimization Metaheuristic under Special Intensification Conditions

Author: A Mason
D Applegate
DL Woodruff
E Angel
F Glover
FF Ehrich
G Laporte
G Reinelt
H Späth
J Brimberg
K Büdenbender
L Sondergeld
MJ Goodwin
MR Garey
MS Darlow
P Hansen
P Shaw
RC Jancey
T Mautor
WJ Conover
Y Rochat
Y Rochat
É Taillard
ÉD Taillard
ÉD Taillard
ÉD Taillard
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

Biography after Historicism: The Harley Lyrics, the Hereford Map, and the Life of Roger De Breynton

Author: C Revard
D Lawton
D Novarr
D Terkla
D Woodward
EE Stoll
FH Ellis
G Alington
GA Usher
HD Emanuel
HD Emanuel
J Boffey
J Catto
JB Margadant
JR Hale
K Aegidius
M Jancey
NR Ker
PDA Harvey
R Bartlett
R Girard
R Monk
RC Finucane
RM Haines
S Fein
S Fein
SD Westrem
VE Wylie
WH Epstein
WJ Courtenay
WJ Dohar
WL Bevan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Balancing effort and benefit of K-means clustering algorithms in Big Data realms

Author: A Konak
AK Jain
AM Fahim
B Hans-Hermann
C Ming-Chao
CC McGeoch
D Aloise
David Romero
DH Fisher
DH Wolpert
G Tzortzis
H Steinhaus
J Pérez
Joaquín Pérez-Ortega
JZ Lai
K Kambatla
M El Agha
M Kantardzic
M Mahajan
Nelva Nely Almanza-Ortega
R Salman
RC Jancey
RO Duda
S Lloyd
SZ Selim
T Chun-Wei
T Kanungo
X Zhanguo
YK Lam
Yong Deng
YP Raykov
YP Raykov
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Improving walking conditions for older adults. A three-step method investigation

Author: A Stathi
A Ståhl
A Ståhl
AB King
Aud Tennøy
B Row
D Sumukadas
E Allardt
F Ross
H Wennberg
H Wennberg
H-W Wahl
HC Borst
J Reed
JM Jancey
Julie Runde Krogstad
JV Cauwenberg
JV Cauwenberg
KS Morris
L Drewes Nielsen
L Frank
L Levin
M Moran
MA Alfonzo
ME Nelson
ML Booth
OECD
PL Mokhtarian
R Hjorthol
R Hjorthol
R Hjorthol
R Hjorthol
Randi Hjorthol
RC Brownson
S Ringen
SL Hughes
T Sugiyama
T Sugiyama
TF Golob
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Deviation between self-reported and measured occupational physical activity levels in office employees: effects of age and body composition

Author: A Cohen
A Sanchez
AA Thorp
AC Grunseit
B Wallmann-Sperlich
BE Ainsworth
BE Ainsworth
BM Kurth
C Tudor-Locke
C Watkinson
CL Craig
D Baty
DC Park
DM Ditchen
E Laperrière
E Sluijs van
HE Brown
J Cohen
J Jancey
J Li
J Skotte
JA Baecke
JA Steeves
JG Uffelen van
JL Fleiss
JY Chau
JY Chau
K Cocker De
Katharina Wick
L Choi
Lars Donath
Lukas Zahner
M Castillo-Retamal
MA Koeneman
MC Kao
MJ Smith
MS Bernstein
MT Hamilton
N Shrestha
O Tikkanen
Oliver Faude
R Durante
RC Plotnikoff
RP Troiano
S Ijmker
SA Adams
SA Clemes
SA Prince
SN Blair
SN Blair
Susanne Schwager
U Ellert
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

New clustering methods for population comparison on paternal lineages

Author: A Diaz-Lacava
A Sanchez-Mazas
A Zupan
A. Zalán
AZ Bíró
C Capelli
DB Goldstein
E. Németh
EP Lessa
F Cruciani
G Childe
G Childe
G. Bárány
H Pamjav
H Pamjav
H. Pamjav
I Borg
I Morozova
J Chiaroni
J Felsenstein
JB Kruskal
JC Bezdek
JX She
K Karun
L Chikhi
L Excoffier
LL Cavalli-Sforza
M Ester
M Kimura
M Nei
M Slatkin
MA Jobling
N Ray
NM Myres
O Balanovsky
P Demartines
P Tamayo
R Nock
R Scozzari
RC Jancey
RR Hudson
S Kanaya
S Mirabal
S Rootsi
T Gayden
T Jombart
T Kanungo
T Kohonen
T. Fehér
V Grugni
VN Kharkov
WH Li
Z Juhász
Z. Juhász
Z. Pádár
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

core

core